NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Runtime Composition of Iterations for Fusing Loop-carried Sparse Dependence

https://doi.org/10.1145/3581784.3607097

Cheshmi, Kazem; Strout, Michelle; Mehri Dehnavi, Maryam (November 2023, ACM)

Dependence between iterations in sparse computations causes inefficient use of memory and computation resources. This paper proposes sparse fusion, a technique that generates efficient parallel code for the combination of two sparse matrix kernels, where at least one of the kernels has loop-carried dependencies. Existing implementations optimize individual sparse kernels separately. However, this approach leads to synchronization overheads and load imbalance due to the irregular dependence patterns of sparse kernels, as well as inefficient cache usage due to their irregular memory access patterns. Sparse fusion uses a novel inspection strategy and code transformation to generate parallel fused code optimized for data locality and load balance. Sparse fusion outperforms the best of unfused implementations using ParSy and MKL by an average of 4.2× and is faster than the best of fused implementations using existing scheduling algorithms, such as LBC, DAGP, and wavefront by an average of 4× for various kernel combinations.
more » « less
MatRox: modular approach for improving data locality in hierarchical (Mat)rix App(Rox)imation

https://doi.org/10.1145/3332466.3374548

Liu, Bangtian; Cheshmi, Kazem; Soori, Saeed; Strout, Michelle Mills; Dehnavi, Maryam Mehri (February 2020, PPoPP '20: Proceedings of the 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming)

Full Text Available
ParSy: inspection and transformation of sparse matrix computations for parallelism

Cheshmi, Kazem and (November 2018, SC '18 Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis)

Full Text Available
ParSy: Inspection and Transformation of Sparse Matrix Computations for Parallelism

https://doi.org/10.1109/SC.2018.00065

Cheshmi, Kazem; Kamil, Shoaib; Strout, Michelle Mills; Dehnavi, Maryam Mehri (November 2018, SC18: International Conference for High Performance Computing, Networking, Storage and Analysis)

Full Text Available
Sympiler: transforming sparse matrix codes by decoupling symbolic analysis

https://doi.org/10.1145/3126908.3126936

Cheshmi, Kazem; Kamil, Shoaib; Strout, Michelle Mills; Dehnavi, Maryam Mehri (January 2017, SC '17 Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis)

Full Text Available

Search for: All records